鈴木商店
鈴木商店 SUZUKI SHOTEN
Home Projects Series Blog Archive 🇯🇵 JP Contact
🇯🇵 JP
Home Projects Series Blog Archive
Contact on LinkedIn
Home / Blog / LLM Comparison
LLM Comparison

1 article tagged with "LLM Comparison"

Part 2. Why the AI Hardcoded the Expected Answers — Comparing Three Models for Code Generation Series

Part 2. Why the AI Hardcoded the Expected Answers — Comparing Three Models for Code Generation

We gave three AI models only the rule specification (in Japanese text) and asked them to write Python check code on their own. Comparing Gemini Flash, Claude Haiku, and Claude Sonnet, we found Gemini Flash had been hardcoding the expected answers.

2026-04-15 Read →

All Tags

AI (53) AI Agent (6) AI Infrastructure (3) AI Security (1) AWS (3) Advertising (2) Agent (3) Alibaba (2) Amazon (3) Android (2) Anthropic (3) Arm (1) Bluetooth (1) Business (16) ByteDance (1) CXL (1) China (3) Claude (2) Cloud (1) Cloudflare (1) Code Generation (1) Coinbase (1) Computer Vision (8) Cost Management (1) Cost Optimization (1) Cryptocurrency (1) Cybersecurity (2) DRAM (1) Data Center (5) Edge AI (1) Edge Computing (1) Education (3) Energy (1) Ethics (1) Finance (1) Fine-tuning (6) Firestore (1) GAN (7) GPU (2) GTC (1) Game Dev (3) Gemini (4) Gemma3 (1) Generative AI (3) Geopolitics (1) Git (1) GitHub (1) Google (2) Google Cloud (2) Google Colab (1) Google Maps (2) HBM (1) Hetzner (1) Hyperscaler (1) IPO (1) Incident (1) Infrastructure (15) Investment (11) Java (2) LLM (11) LLM Comparison (1) LoRA (2) Machine Learning (14) Memory (1) Meta (3) Micron (1) Microsoft (2) Mobile (2) Multi-Agent (1) MySQL (1) Mythos (1) NLP (6) NVIDIA (5) Next.js (5) Ollama (1) Open Source (1) OpenAI (8) OpenClaw (5) PHP (5) Palantir (1) Payments (1) Phaser 3 (3) Python (16) Real-Time (2) SK hynix (1) SaaS (4) Samsung (1) Security (2) Semiconductor (3) Semiconductors (1) Sensors (2) SoftBank (2) Space (1) SpaceX (2) Stablecoin (1) Stock Prediction (5) Strategy (1) Streaming (1) Stripe (1) System Design (1) TTS (1) Tencent (2) Tesla (1) Translation (2) Twitter API (1) TypeScript (3) Vertex AI (1) Virtual Try-On (3) White-Collar (1) Windows (1) heartbeat (1) iOS (1) xAI (1)
鈴木商店

鈴木商店

SUZUKI SHOTEN

Senrigan MarketQuest
© 2026 鈴木商店 / SUZUKI SHOTEN All rights reserved.
Search articles...